Search CORE

1,525 research outputs found

Nextprot: the new human protein knowledge resource

Author: Bairoch A.
Publication venue: Universidad de Córdoba, Servicio de Publicaciones
Publication date: 01/01/2011
Field of study

Comunicaciones a congreso

Repositorio Institucional de la Universidad de Córdoba

Suggestion to research groups working on protein and peptide sequence

Author: A Bairoch
Publication venue: 'Portland Press Ltd.'
Publication date
Field of study

Crossref

Distance, dissimilarity index, and network community structure

Author: A. Bairoch
C. von Mering
C.M. Deane
E. Ravasz
H. Zhou
H.W. Mewes
Haijun Zhou
I. Xenarios
L.C. Freeman
M. Girvan
P. Uetz
W.W. Zachary
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2003
Field of study

We address the question of finding the community structure of a complex network. In an earlier effort [H. Zhou, {\em Phys. Rev. E} (2003)], the concept of network random walking is introduced and a distance measure defined. Here we calculate, based on this distance measure, the dissimilarity index between nearest-neighboring vertices of a network and design an algorithm to partition these vertices into communities that are hierarchically organized. Each community is characterized by an upper and a lower dissimilarity threshold. The algorithm is applied to several artificial and real-world networks, and excellent results are obtained. In the case of artificially generated random modular networks, this method outperforms the algorithm based on the concept of edge betweenness centrality. For yeast's protein-protein interaction network, we are able to identify many clusters that have well defined biological functions.Comment: 10 pages, 7 figures, REVTeX4 forma

arXiv.org e-Print Archive

Crossref

MPG.PuRe

ORENZA: a web resource for studying ORphan ENZyme activities

Author: A Bairoch
A Bairoch
A Fleischmann
Bernard Labedan
DG Naumoff
DL Wheeler
HM Berman
I Schomburg
M Kanehisa
ML Green
N Hulo
O Lespinet
O Lespinet
Olivier Lespinet
PD Karp
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Despite the current availability of several hundreds of thousands of amino acid sequences, more than 36% of the enzyme activities (EC numbers) defined by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB) are not associated with any amino acid sequence in major public databases. This wide gap separating knowledge of biochemical function and sequence information is found for nearly all classes of enzymes. Thus, there is an urgent need to explore these sequence-less EC numbers, in order to progressively close this gap. DESCRIPTION: We designed ORENZA, a PostgreSQL database of ORphan ENZyme Activities, to collate information about the EC numbers defined by the NC-IUBMB with specific emphasis on orphan enzyme activities. Complete lists of all EC numbers and of orphan EC numbers are available and will be periodically updated. ORENZA allows one to browse the complete list of EC numbers or the subset associated with orphan enzymes or to query a specific EC number, an enzyme name or a species name for those interested in particular organisms. It is possible to search ORENZA for the different biochemical properties of the defined enzymes, the metabolic pathways in which they participate, the taxonomic data of the organisms whose genomes encode them, and many other features. The association of an enzyme activity with an amino acid sequence is clearly underlined, making it easy to identify at once the orphan enzyme activities. Interactive publishing of suggestions by the community would provide expert evidence for re-annotation of orphan EC numbers in public databases. CONCLUSION: ORENZA is a Web resource designed to progressively bridge the unwanted gap between function (enzyme activities) and sequence (dataset present in public databases). ORENZA should increase interactions between communities of biochemists and of genomicists. This is expected to reduce the number of orphan enzyme activities by allocating gene sequences to the relevant enzymes

Crossref

Springer - Publisher Connector

PubMed Central

Identification of Amino Acid Sequences with Good Folding Properties in an Off-Lattice Model

Author: A. Bairoch
A. Irbäck
A. Irbäck
A. ̆Sali
Anders Irbäck
C. Peterson
C.J. Camacho
Carsten Peterson
E. Marinari
F.H. Stillinger
Frank Potthast
G. Iori
H.S. Chan
J. Hertz
J.D. Bryngelson
M. Fukugita
N.A. Metropolis
N.D. Socci
S. Duane
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/1997
Field of study

Folding properties of a two-dimensional toy protein model containing only two amino-acid types, hydrophobic and hydrophilic, respectively, are analyzed. An efficient Monte Carlo procedure is employed to ensure that the ground states are found. The thermodynamic properties are found to be strongly sequence dependent in contrast to the kinetic ones. Hence, criteria for good folders are defined entirely in terms of thermodynamic fluctuations. With these criteria sequence patterns that fold well are isolated. For 300 chains with 20 randomly chosen binary residues approximately 10% meet these criteria. Also, an analysis is performed by means of statistical and artificial neural network methods from which it is concluded that the folding properties can be predicted to a certain degree given the binary numbers characterizing the sequences.Comment: 15 pages, 8 Postscript figures. Minor change

arXiv.org e-Print Archive

CiteSeerX

Lund University Publications

Crossref

neXtProt: a knowledge platform for human proteins

Author: A. Bairoch
A. Britan
A. Gateau
A. Gleizes
A. Masselot
Altschul
Ashburner
Bairoch
C. Zwahlen
Deutsch
G. Argoud-Puy
Goel
I. Cusin
Kelso
L. Lane
Liebel
O. Evalet
Orchard
P. D. Duek
P. Gaudet
Sewell
Shannon
Simpson
Uhlen
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

neXtProt (http://www.nextprot.org/) is a new human protein-centric knowledge platform. Developed at the Swiss Institute of Bioinformatics (SIB), it aims to help researchers answer questions relevant to human proteins. To achieve this goal, neXtProt is built on a corpus containing both curated knowledge originating from the UniProtKB/Swiss-Prot knowledgebase and carefully selected and filtered high-throughput data pertinent to human proteins. This article presents an overview of the database and the data integration process. We also lay out the key future directions of neXtProt that we consider the necessary steps to make neXtProt the one-stop-shop for all research projects focusing on human proteins

Crossref

PubMed Central

Archive ouverte UNIGE

ExplorEnz: the primary source of the IUBMB enzyme list

Author: A. G. McDonald
Bairoch
K. F. Tipton
Kanehisa
McDonald
S. Boyce
Schomburg
Tipton
Publication venue: Oxford University Press
Publication date
Field of study

ExplorEnz is the MySQL database that is used for the curation and dissemination of the International Union of Biochemistry and Molecular Biology (IUBMB) Enzyme Nomenclature. A simple web-based query interface is provided, along with an advanced search engine for more complex Boolean queries. The WWW front-end is accessible at http://www.enzyme-database.org, from where downloads of the database as SQL and XML are also available. An associated form-based curatorial application has been developed to facilitate the curation of enzyme data as well as the internal and public review processes that occur before an enzyme entry is made official. Suggestions for new enzyme entries, or modifications to existing ones, can be made using the forms provided at http://www.enzyme-database.org/forms.php

Crossref

PubMed Central

Network Landscape from a Brownian Particle's Perspective

Author: A. Bairoch
A.-L. Barabási
B. Tadić
C.M. Deane
D.J. Watts
E. Ravasz
E. Selkov Jr.
H. Jeong
Haijun Zhou
I. Xenarios
L.C. Freeman
M. Faloutsos
M. Girvan
M.E.J. Newman
R. Albert
R. Guimerà
S.H. Strogatz
S.N. Dorogovtsev
T.A. Davis
W.W. Zachary
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2003
Field of study

Given a complex biological or social network, how many clusters should it be decomposed into? We define the distance

d_{i,j}

from node

i

to node

j

as the average number of steps a Brownian particle takes to reach

j

from

i

. Node

j

is a global attractor of

i

d_{i,j}\leq d_{i,k}

for any

k

of the graph; it is a local attractor of

i

, if

j\in E_i

(the set of nearest-neighbors of

i

) and

d_{i,j}\leq d_{i,l}

for any

l\in E_i

. Based on the intuition that each node should have a high probability to be in the same community as its global (local) attractor on the global (local) scale, we present a simple method to uncover a network's community structure. This method is applied to several real networks and some discussion on its possible extensions is made.Comment: 5 pages, 4 color-figures. REVTeX 4 format. To appear in PR

arXiv.org e-Print Archive

Crossref

MPG.PuRe

A Re-Annotation of the Saccharomyces Cerevisiae Genome

Author: A Ivens
Altschul
B. Barrell
Bairoch
Bairoch
Bateman
Berbee
Birney
Blandin
DeRisi
Dujon
Gaillardin
Goffeau
Hieter
K. M. Rutherford
Lowe
M-A Rajandream
Mackiewicz
Malpertuy
Mewes
Oliver
Oliver
Pearson
Rutherford
Sharp
Sonnhammer
Stoesser
V. Wood
Xiang
Zhang
Publication venue: Hindawi Publishing Corporation
Publication date: 01/01/2001
Field of study

Discrepancies in gene and orphan number indicated by previous analyses suggest that S. cerevisiae would benefit from a consistent re-annotation. In this analysis three new genes are identified and 46 alterations to gene coordinates are described. 370 ORFs are defined as totally spurious ORFs which should be disregarded. At least a further 193 genes could be described as very hypothetical, based on a number of criteria. It was found that disparate genes with sequence overlaps over ten amino acids (especially at the N-terminus) are rare in both S. cerevisiae and Sz. pombe. A new S. cerevisiae gene number estimate with an upper limit of 5804 is proposed, but after the removal of very hypothetical genes and pseudogenes this is reduced to 5570. Although this is likely to be closer to the true upper limit, it is still predicted to be an overestimate of gene number. A complete list of revised gene coordinates is available from the Sanger Centre (S. cerevisiae reannotation: ftp://ftp/pub/yeast/SCreannotation)

Crossref

Directory of Open Access Journals

PubMed Central

ProRule: a new database containing functional and structural information on PROSITE profiles

Author: Bairoch Amos
De Castro Edouard
Hulo Nicolas
Langendijk-Genevaux Petra S.
Le Saux Virginie
Sigrist Christian J. A.
Publication venue
Publication date: 02/08/2017
Field of study

Motivation: Increase the discriminatory power of PROSITE profiles to facilitate function determination and provide biologically relevant information about domains detected by profiles for the annotation of proteins. Summary: We have created a new database, ProRule, which contains additional information about PROSITE profiles. ProRule contains notably the position of structurally and/or functionally critical amino acids, as well as the condition they must fulfill to play their biological role. These supplementary data should help function determination and annotation of the UniProt Swiss-Prot knowledgebase. ProRule also contains information about the domain detected by the profile in the Swiss-Prot line format. Hence, ProRule can be used to make Swiss-Prot annotation more homogeneous and consistent. The format of ProRule can be extended to provide information about combination of domains. Availability: ProRule can be accessed through ScanProsite at http://www.expasy.org/tools/scanprosite. A file containing the rules will be made available under the PROSITE copyright conditions on our ftp site (ftp://www.expasy.org/databases/prosite/) by the next PROSITE release. Contact: [email protected]

RERO DOC Digital Library